Enhancing Phoneme Recognizer Performance with a Simple Rule-based Language Model
نویسندگان
چکیده
The phoneme classification inaccuracy at the acoustic phonetic level is a major weakness in most speech recognition systems. However, the inaccuracy will violate phonotactic constraints at the acoustic phonetic level. A better performance is expected if a language model is adopted in a recognition system for post-processing phoneme estimates and making corrections with a set of explicit rules of the language used. We developed a simple language model with a set of rules to correct phoneme classification errors made by a Hidden Markov Model based phoneme recognizer. The experimental results indicate that about 20% of recognition errors can be corrected.
منابع مشابه
State-Dependent Phoneme-Based Model Merging for Dialectal Chinese Speech Recognition
Aiming at building a dialectal Chinese speech recognizer from a standard Chinese speech recognizer with a small amount of dialectal Chinese speech, a novel, simple but effective acoustic modeling method, named statedependent phoneme-based model merging (SDPBMM) method, is proposed and evaluated, where a tied-state of standard triphone(s) will be merged with a state of the dialectal monophone th...
متن کاملImproved Classification of Speaking Styles for Mental Health Monitoring Using Phoneme Dynamics
This paper investigates the usefulness of segmental phonemedynamics for classification of speaking styles. We modeled transition details based on the phoneme sequences emitted by a speech recognizer, using data obtained from a recording of 39 depressed patients with 7 different speaking styles normal, pressured, slurred, stuttered, flat, slow and fast speech. We designed and compared two set of...
متن کاملAutomatic Recognition of Lyrics in Singing
The paper considers the task of recognizing phonemes and words from a singing input by using a phonetic hidden Markov model recognizer. The system is targeted to both monophonic singing and singing in polyphonic music. A vocal separation algorithm is applied to separate the singing from polyphonic music. Due to the lack of annotated singing databases, the recognizer is trained using speech and ...
متن کاملA hybrid word / phoneme-based approach for improved vocabulary-independent search in spontaneous speech
For efficient organization of speech recordings – meetings, interviews, voice mails, and lectures – being able to search for spoken keywords is essential. Today, most spoken document retrieval systems use large-vocabulary recognition. For the above scenarios, such systems suffer from the unpredictable domain, out-ofvocabulary queries, and generally high word-error rate (WER). In [1], we present...
متن کاملDevelopment of Multilingual Acoustic Models in the GlobalPhone Project
This paper describes our recent eeort in developing the Glob-alPhone recognizer for multilingual large vocabulary continuous speech. Turkish. Based on ve languages we developed a global phoneme set and built multilingual speech recognizer by variing the method of acoustic model combination. Context dependent phoneme models are created using questions about languages and language groups. Results...
متن کامل